Toward Drawing an Atlas of Hypothesis Classes: Approximating a Hypothesis via Another Hypothesis Model
نویسندگان
چکیده
Computational knowledge discovery can be considered to be a complicated human activity concerned with searching for something new from data with computer systems. The optimization of the entire process of computational knowledge discovery is a big challenge in computer science. If we had an atlas of hypothesis classes which describes prior and basic knowledge on relative relationship between the hypothesis classes, it would be helpful in selecting hypothesis classes to be searched in discovery processes. In this paper, to give a foundation for an atlas of various classes of hypotheses, we have defined a measure of approximation of a hypothesis class C1 to another class C2. The hypotheses we consider here are restricted to m-ary Boolean functions. For 0 ≤ ε ≤ 1, we say that C1 is (1− ε)-approximated to C2 if, for every distribution D over {0, 1}, and for each hypothesis h1 ∈ C1, there exists a hypothesis h2 ∈ C2 such that, with the probability at most ε, we have h1(x) 6= h2(x) where x ∈ {0, 1} is drawn randomly and independently according to D. Thus, we can use the approximation ratio of C1 to C2 as an index of how similar C1 is to C2. We discuss lower bounds of the approximation ratios among representative classes of hypotheses like decision lists, decision trees, linear discriminant functions and so on. This prior knowledge would come in useful when selecting hypothesis classes in the initial stage and the sequential stages involved in the entire discovery process.
منابع مشابه
LINEAR HYPOTHESIS TESTING USING DLR METRIC
Several practical problems of hypotheses testing can be under a general linear model analysis of variance which would be examined. In analysis of variance, when the response random variable Y , has linear relationship with several random variables X, another important model as analysis of covariance can be used. In this paper, assuming that Y is fuzzy and using DLR metric, a method for testing ...
متن کاملA Dynamic Analysis of Market Efficiency on Benchmark Crude oil markets: Based on the Adaptive Market Hypothesis
This paper examines the applicability of the adaptive market hypothesis (AMH) as an evolutionary alternative to the efficient market hypothesis (EMH) by studying daily returns on the three benchmark crude oils. The data coverage of daily returns is from January 2th 2003 to March 5th 2018. In this paper, two different tests in the form of two distinguished classes (linear and nonlinear) have bee...
متن کاملAn analysis about urban creativity in Qazvin (Case study: Triple regions of Qazvin)
Creative city is a house for art creativities, scientific and technological innovations and clear voice of developing cultures; the city which fulfills all its creative potentials and is the pioneer of cultural and developmental activities. Qazvin is among the prone cities that doesn’t have a far distance to its prosperity of creativity. Therefore, this research tries to analyze the indicators ...
متن کاملTesting for Stochastic Non- Linearity in the Rational Expectations Permanent Income Hypothesis
The Rational Expectations Permanent Income Hypothesis implies that consumption follows a martingale. However, most empirical tests have rejected the hypothesis. Those empirical tests are based on linear models. If the data generating process is non-linear, conventional tests may not assess some of the randomness properly. As a result, inference based on conventional tests of linear models can b...
متن کاملInvestigating the Impact of Growth of Petroleum Products Consumption on Economic Development with a Systematic Dynamics Approach in Developing Countries
I n addition to labor force and capital, energy plays a significant role in the production of commodities and services. Energy is the driving force of production activities. Therefore, it is an essential ingredient of growth and development. Results obtained from this paper show that the growth of oil products consumption has a positive effect on economic development via two channels: Firstly,...
متن کامل